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by 

Shanti  S.  Gupta,  Purdue  University  and 
Deng-Yuan  Huang,  Academia  Sinica,  Taipei,  Taiwan 

I.  IiUroductiori 

In  most  practical  situations  to  which  the  analysis  of  variance  tests 
are  applied,  they  do  not  supply  the  information  that  the  experimenter  aims 
at.  If,  for  example,  in  one-way  ANOVA  the  hypothesis  is  rejected  in 
actual  application  of  the  F-test,  tne  resulting  conclusion  that  the  true 
means  0-j , * • • • drc  not  a11  equal,  would  by  itself  usually  be  insufficient 

to  satisfy  the  experimenter.  In  fact  his  problems  would  begin  at  this  stage. 
The  experimenter  may  desire  to  select  the  “best11  population  or  a subset  of 
the  "good"  populations;  he  may  like  to  rank  the  populations  in  order  of 
"goodness"  or  he  may  like  to  draw  some  other  inferences  about  the  parameters 
of  interest. 

The  extensive  literature  on  selection  and  ranking  procedures  depends 
heavily  on  the  use  of  independence  between  populations  (block,  treatments, 
etc.)  in  the  analysis  ol  variance'.  In  practical  applications,  it  is 
desirable  to  drop  this  assumption  of  independence  and  consider  cases  more 
general  than  the  normal. 

Our  interest  is  to  derive  a method  to  construct  locally  host  (in  some 
sense)  selection  procedures  to  select  a nonempty  subset  of  the  k populations 

*Thi s~research  was  supported  by  the  Office  of  Naval  Research  Contracts 
N00014-67-A-0226  0014  and  N00014- 7h-c-04!i!'.  at  Purdue  University.  Reproduction 
in  whole  or  in  part  is  permitted  tor  any  purpose  of  the  United  States 
Government. 
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containing  the  best  population  as  ranked  in  terms  of  (k's  (defined  below) 
which  control  the  size  of  the  selected  subset  and  maximizing  the  probability 
of  selecting  the  best.  We  also  consider  the  usual  selection  procedures  in 
one  way  ANOVA  based  on  the  generalized  least  square  estimates  and  apply  the 
method  to  two  way  layout  case.  Some  examples  are  discussed  and  some  results 
on  comparisons  with  other  procedures  are  also  considered. 


Locally  Best:  Selection  Procedures 


Let  ■ i j,  represent  k ( • ?)  populations  and  let  j , . . . , X.^ 
be  n.j  independent  random  observations  from  v .■ . The  selection  procedures 
will  depend  upon  the  observations  through  T\j  which  are  defined  as  follows. 

Let  T.  j * T(xii ’•  • • ’X.jn  ; ) be  based  on  the  n^  arid  n- 

observations  from  . and  r-  (i,j  • l,2,...,k),  respectively.  In  a given 

problem  the  function  T is  so  chosen  as  to  indicate  the  differences  between 

the  populations  in  a reasonable  wav.  Tor  example,  if  the  observations 

drawn  from  ^ are  norma  I i / distributed  with  unknown  mean  o.,  (1  <_  i y k), 

and  knov.n  var  iance  a choice  of  I.  might  be  X--5L,  where 

„ ij  ‘ > J 


l. 


I A. 


j 

> X 


Now  we  assume  that  T—  has  a joint 


) X.  and 

' n i . • 1 1 ' •'  nj  . 1 

probability  density  function  g (•)  depending  on  the  parameter  • and  assume 

' i i 

that  ■'j1's  are  known.  Usually  K-'s  are  chosen  to  obtain  both  sufficient 


and  maximal  invariant  statistics  for  i ^ ■ 


Let 


min 

jf' 


iJ 


Return ing 


to  the  above  normal  means  problem,  we  find  that  ~ 0^-o.j  and 


ri  ~ 


[k]  for  Ji 


' [ k ] aru 

'’[1]  •••'  ,![H- 

A population  is  said  to  be  best  if  i- 
normal  means  example. 


. * * - ~ 1 1 r [ ii  for'  • ? * 

l r | k - 1 j r 


"j-kj,  where 


. is  best  it  >>■ 

i i 


max  i . . for  the  above 
1 i-  k 1 

, i and  i n th  l s case  . • 0 

k J ii 


and  1 i " f’[kJ"'(k-lj- 
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Let  5i  = .1  j 1 k,  j t i),  c]  = (TifjTifc  = T^.,  1 < p.  <k,«f  i) 

and  2 ^ = ( z ^ j i 1 • j ' k,  j f i)  are  all  k-1  dimensional  vectors.  Assume  that 

the  joint  density  of  T. . , j = 1 ,2,. . . ,k,  j j i , is  f (z, ) , 1 < i < k,  (with 

-i 

respect  to  some  o-finite  measure  u).  Let  -i-  be  the  probability  of  selecting  n • , 


Sf  = IS:  t = (6 


r--"5k)’  Is < 


s;  = {5:  <5  = (-5,  , V /^(z-Jf  i(zi)dy(zi)  - r). 

i ;;  1 > ■ 

- 1 

Theorem:  Let  <5^  = (t® be  defined  by 


1 if  mir,  -rt  - f.  (z.)j  . 
1'  j-  k "ij  Li  '■>  rj 
! * i 


cfri(?d- 


Then  6°  maximizes  } min  /^(z-)  f.  (z-)|  i di-(z^)  among  all  rules 


i = l jfi 


1J  -1  -’i 


>,  € S^.  i s is  called  a locally  best  procedure  in  this  sense 


Proof.  For  any  • C 3^ , 


/ (?, ) i!i  i n f (/. ) | • d:.(zj 

v-1  jji  u i t 


, i ■;< 


z jmin  f.  I.’-,):  i < 1 1 ■ ( z 1 ) 

.1 1 i n -i 


V /[  ‘ :(?  i )■■■?(’  j ) If11''"  - f-  (?j)l  rrf  i (i'-j ) jd,.  ( z i ) 

i-i  i!i  'J 


f[-  i (?!  j)J»  ) ()- 


This  proof  is  complete 
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k r ~ % (x.-o.)2 

Example:  Let  g0(x)  - n gfl  (x-),  where  g (L)  - ~ e 1 . Let 

- j-1  i i /2i: 

Tij  = i"  j’  1 - J k’  J H-  - 0,  and  Z.  ■ - j i i.  We  know 

that  a maximal  invariant  under  a group  G is  T.  - (JF , . . . ,X  . ) where 
G is  the  group  of  transformations 

9?i  = (2-j  j + C,.  . . ,Z-y  + c),  - - c • 

which,  in  the  parameter  space,  induces  the  transformat  ions  g t . . = • +c. 

^ J 1 J 


Since 


(k-l ) x ( k- 1 ) ' n 


^ “ ( 


) is  the  covariance  matrix  of  7..'s.  We 

U 


know  that  : is  positive  definite,  and 


-1  n 

' F 


Hence 


= (2:;)  ■ :n’  ^v.J)[((M)(zir,il)^(7il-tii)Ui2-,i2) 

^iT’ir  (zik‘  ik})  +---+  ' j ! “ ' i 1 1 ?ik* : i k^ 


..J. f (7  ) ! - (?  1 

j,-.  ■ T r . - 1 . '•  1 

1.  -1  hi 


;•  '•  Pxp[  (-£)[(  (k-l);^ 


“zi  lzi2  -•••-  znziL.)  + --+  1 •-  n 7 


i r i k 


i (k-l  )* ik 


+ (k-Dz-J]>  • !-22;1  -•••-  2zi(i.nf2(k-l)2;:-Pz, 


i j " ' " i ( j f 1 ) 


2zik‘- 


Hence 


or 


'?l?i> 


1 if  min[- 
Jfi 


k 

y 

-fi  .j 


7,,,-Kk-Dz^]  > 


> C, 


( 0 


1 i f max 


1- j-  k 


X . 
J 


c, 


f i 


3.  Usual  Approach  to  Selection 
Problem1,  in  One  W ay  Layout 


Let  tt -j 

* O « * * * > 

r ^ he  k populations,  let  X . -j , . 

. . ,X.  denote  n. 

’ ini  i 

independent 

observar. 

ions  from  the  ith  population 

. Let  the  joint  density 

" 'll 

Ir^  ' 'L  l 

' ’X?n0”  ’ ' ; Xkl  ’•  ’ ' ,Xknk  be 

of  the  following  form: 

• "5.1/ 

1 1 

r i ( ( x. - 1 , ) ' \ ^ (x-^  ' ) 

nl  nk 

where  <’  r 

1 v I r • ' ' 

' 1 , n ( ' ; xkl * xknv ^ 

("1  "1  " ‘ ' ; ; k ’ ' ' ‘ *"k 

and  i*.  a 

known  |u, 

itive  definite  matrix  and  is 

. determineo  such  that 

(3.1;  is  a 

dens  i t> . 

Let  ... 

_ . 1 

i ■ ,o.  ) : and  also,  let 

, y 

K 

where  r = with  n-  components  and  [ n.  = N > k. 

We  consider  the  analysis  of  variance  problem  in  a one  way  layout; 


'll  ' Veij’  J*1 V ^ k 


which  is  in  the  form  of  the  general  linear  model 


x = Ae  + e. 


where  e‘  = (e^,...,e^n  . ;ek^ , . . . ,ekn  ) with  var(e)  = A. 

t k 

We  know  that  the  generalized  least  square  estimator  of  g is 


= (A’a'1A)‘1A,a"1; 


Since  g = Bx,  8 = (A ' a" 1 A) _1A ' 1 , the  joint  density  of  Y’  = (Yj,...,Yfc) 
is  of  the  form 


(3.2)  bk | A1 i’n  h((y-a)'A'1(y-g)) 


where  A^  = BaB'  = (0^). 

The  ordered  o-'s  are  denoted  by  Oj-^j  :-•••_  ’'[|cj-  assumed  that 

there  is  no  prior  knowledge  of  the  correct  pairing  of  the  ordered  and  the 
unordered  o^'s.  Let  us  denote  by  the  population  (unknown)  associated 

with  0 ^ j , i = l,2,...,k.  Our  goal  is  to  select  a non-empty  subset  of  the 
k populations  so  as  to  include  the  population  associated  with  Defining 

any  such  selection  as  a correct,  selection,  we  wish  to  define  a procedure  R 
so  that  P(CS | R) , the  probability  of  a correct  selection,  is  at  least  a 
preassigned  number  P*(-£  P*  - 1 ) . We  will  refer  to  this  requirement  as 
the  P*-condi tion.  We  propose  the  following  rule  R based  on  Y . , 1 • i ' k. 
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R:  Retain  . in  the  selected  subset  if  and  only  it 


Y.  • *xk  (Vrc^liV-jr2,jj), 

where  c - c(k,P*;  n.  ,n. . ,1  i , j<k)  :■  0 is  chosen  so  as  to  satisfy  the 
P*-condition . 

Let  and  denote  the  observation  and  the  variance  associated 

with  the  population  with  mean  i = l,2,...,k.  Of  course,  both 


and  are  unknown  as  in 

Y( j ) ' Th“S 


• ^ , the  covariance  of  Y^  and 


(3.3)  P(CS|R)  = P Y((r;)  , ima^(Y(jrc4(k)(k)+a(j)(j)-2o(k)(j))} 


= P{"jk  - L + (:'[kj"'J[j])(o(k)(k)+a(j)(j)-^(k)(j) 


,-\P»  1 < j < k-1  } » 


where  for  s.,  1 i < k,  we  define 


(3‘4)  Zrt  - (Y(r)-Y(;.)-0Cr]+f,[,])(a(r)(r)+o(t)(O'2o(r)(A))4’ 


for  r = l,2,...,k,  r f 

Let  Zf  = Y Af,  where  Zf  = ( Zff : r = l,2,...,k,  r f i. ) and 
- = (Y(l)'u[l],'--’Y(k)"e[kj)- 

The  matrix  A with  k rows  and  (k-1)  columns  is  defined  as  follows: 


01  ,,c.  - ('-  f r)  / r)r 


n " u'  (r)(r)  JU)(y)~  ir)(P 


-2a,  . , I < r,  < k,  r =f  i; 


(3.5)  A;  = 


0 0 


'al  i "a2x 
0 0 


0 0 


i- 1 , fc 

Vl,*  "Vl,». 

0 Vl.Jl 
0 


o \ 


k,t  ' 


Since  A^A,,  - (;  • i,  j - l,2,...,k;  i,  j =f  £.,  for  1 <_  i,  < k and 


Af  is  of  rank  k-1,  hence  the  joint  density  of  Z,  is 


(3-6)  { z t ) = ak|A;7.1A  J"-;  f(?‘ (A-.:1Av  )"1zz)- 


For  any  given  association  between  (o^-,  i,  j=l,2,...,k)  and 


i > j = 1,2,... ,k)  we  can  see  from  (3.3)  that  the  infimum  of  P(CS|R 


is  attained  when  n 


[1]  [k] 


Hence, 


(3.7)  inf  P(CSiR)  = min  P{7  - c,  r=l ,2, . . . ,k ,rfi;  (p| ^ } }. 


0 0:2 


1 -;<k 


ij 


For  f < k,  let  be  such  that 


(3.8) 


k 

(i)  < ’ j > K-i j * i,  jft,  i,  jfk ; 

(ii)  kJj,  3 = 1,2,.  ..,k,  jft,  k. 


and  for  any  (1  <_  y.  •'  k),  there  exists  an  m,  (1  m k),  such  that 
for  any  i,  j,  i,  j*l,2,...,k,  i,  ju,  i?j, 

J'-)  - . (m) 

ii  Kr,s 

for  some  r,  s,  rfs,  r,  sfm,  r,  s = l ,2,. . . ,k. 


Lemma  [3], 


If  X 1 - (X^,...,X  ) has  density 


then  for  any 


two  positive  definite  (symmetric)  pxp  matrices  - (r^)  and  r.,  = (o^) 
such  that  r^  = o^,  1 ; i < p and  r^  1 i < j < p. 


Pr  f X, 
' 1 1 


Pr.  tx 


9.  ) 

P 


for  any  real  numbers 


• • i 


9 


10 


ij}  " '!•  ,i-n'(1) 


for  i * j,  i,  j t Lot 


1 


(1  * „ < 
i ) 


'<  i ) 


i ; 


(.3.  iU) 


1 J *!lr  : -(  1 l!,[  1 1 

ii![  i ] "‘[,]  j 


, i U.  j | 


Then,  it  it  oat/  to  chock  that  the  condition  (3.8)  it  satisfied 

Expected  subset  size  tor  a special  case 

Let  the  joint  density  p as  in  (3.2)  have  th°  form 


(3.11) 


P ( * ) h(v';'‘y; 


where  P°r  i>we  definite  with  ^ 

V 


lv 


when  i t j,  • and  , are  known.  Let  $ be  the  size  of 
subset  excluding  the  i-er t.  population.  Then  the  expected  subset 

given  by 


K-  t 

E ( S ! R ) - r>  P. Selecting  ^.JR) 
i i 111 


k-1 

) PiY/.  . ■ max  Y , • s -c  ; >r?T\-»)  1 
i ' - 1 ' m,  k (j) 

k-1  i , \ •>  j \ 

l fB  k.  hl{A]  )l"'  Uz'i  \\  u-.)(i2-. 

ir  I ■ 


where 


M'V) 


)[?  ‘ (1-.  ; )'-  , cl,?,  . . ,k,  rj.  , 


B +_i,  - (?r  , - c+f  'j- } j - j r j ) | ? ' ( l - , ) j , 


r - 1 k , r j . , 


k. 


and  n | j ^ defined  as  in  (3.6)  is 


and 

the  selected 
size  is 


1? 


Remark:  Let  p be  the  multivariate  normal  density  as  in  (3.11),  then  f 

has  the  required  property. 


3.1.  Appl ications  to  Normal  Populat ions 


Let  ,tt„,  . . . ,:ir  be  k independent  normal  populations  with  means 

9 y 2 2 2 2 

j 1 , uk  and  variances  o'j.o.,, . . . respectively.  Let  =...=  >jy  - a , 


where  o'  may  or  may  not  be  known. 


Case  (a):  oL  known.  We  assume  without  any  loss  of  generality  that  a - l, 

and  for  this  problem  ( 3.2 ) assumes  the  following  form: 

_ k 

p(x)  = (20  h(  (y-;_. ) ' O’ Oy-,. ) ) , 

where  y ' = (u-j , • . . ,;.k) , h(x)  ••  e , and  D diag(n1  ,...,nk  ). 

Gupta  and  Huang  [6]  proposed  the  following  rule  based  on  the  sample 
means  Y..  from  it.,  i = 1,2,...,k. 

R-| : Retain  t:  . in  the  selected  subset  if  and  only  if 

Y.  > max  (Y.-c,/  ' * — ) , 

1 _ ! 1 "j 

where  = C| (k,P*,n, , . . . ,nk)  > 0 is  chosen  so  as  to  satisfy  the  P*-condition. 
For  the  condition  (3.10),  ^ I impiies  inf  ■ n-\  1 < i < k.  Therefore 


any  i,  1 < c •;  k, 


y-ii  = [(!  + 1 J'^)'P,  i f J.  i J t t. 


i k,  i | 


Let  n,  = ( 1 + :r^P  , i = 1 , 
1 " l i 1 

and  <ku  = 1,  1 ' i •;  k-1. 


k-1.  Thus  c i ^ ■ iyij,  i tj,  i , j = 1 , .k-1 


By  (3.7),  we  have 


(3.12)  inf  P(CS | R) 


*/  •••  / (2.)'Zl(.)j>r^f(z'(,5jr,zk)dz,k. 


k-1 ,k’ 


where  Z^,...,7^_j  ^ are  standard  normal  random  variables  with  correlation 
>rS  = ‘y^s’  ^ s *cnown  that  "lk’.’.’^k-l  k can  generated  from  k independent 

standard  variates  Y, , . . . ,Y^_i  ,Y  by  the  transformation 

V = W- 

and  then  (3.9)  is  as  follows: 

(3.13)  inf  P(CS,R. ) - / k;,'  ■:  ('---+%)« (u) . 

ri  O-fY 

O O o 

Case  (b):  n unknown.  Let  s denote  the  usual  pooled  estimate  of  nc  on  j 

degrees  of  freedom.  Gupta  and  Huang  [6]  proposed  the  rule  R?  tor  selecting 
a subset  containing  the  population  associated  with  the  largest  ,j^'s. 

R„:  Retain  . in  the  selected  subset  if  and  only  if 

Y-  max  (Y.-c0s  / — + 

' 1 , 1 1 • - "i  "i 

where  c,  = rp(k,P*,n^,. . . ,n^)  ■ 0 is  to  be  determined  so  that  the  P*-cond i tior 
is  satisfied. 

Using  the  same  argument  as  in  case  (a),  we  ran  obtain 

■ k-1  C0U-(  -X 

(3.14)  inf  P(CSJR0)  = / / ..  tf  " ]dt(x)dO  (u) 

o - ■■  ,i  i "> 

where  Q (u)  denotes  the  edf  of  a A /*  variate. 
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The  Evaluation  of  the  Constant  c 1 Associated  with  R -| 

Let  . ,U|(_-|  be  k-1  standard  normal  random  variables,  and  the 
correlation  coefficient  of  and  LG  be  p,  i,j  = l,...,k-l,  where 

U = C(1  + nJ-kl)  (1  + 

"m  nm 

k 

Using  the  same  notation  as  before,  we  have  *..•  > p,  i , j = hence 

P(Zik  < c,  i • '^j)l 

>.  P 1 < c , i = 1 , . . . , k- 1 , if.  j i 

= f ♦k-1(^fi^)d*(u). 

-w  v'T 

Equating  the  above  integral  to  P*,  values  of  c are  available  for  the  equi- 
correlated  LG's  from  the  tables  in  Gupta,  Nagel  and  Panchapakesan  [7]. 
These  c-values  will  be  greater  than  the  exact  c-j-values  satisfying  the 
equations  by  equating  the  left  hand  side  of  (3.13)  to  P*.  Some  of  the 
exact  c^ -values  can  be  obtained  from  Taole  1 of  Gupta  and  Huang  [6]. 


Some  Results  on  Comp arisons 


Assume  that  cL  - 1.  The  procedure  of  Gupta  and  D.  Y.  Huang  [6] 
is  more  efficient  than  Gupta  and  W.  T.  Huang  [5]  for  the  case  of  k = 2, 

n j- 1 -j  - 0 < ft  < 1 . 

For  v unknown,  Chen,  Dudewicz  and  Lee  [2],  have  proposed  a class  of 

procedures  as  follows: 


R : Retain  -t . in  the  selected  subset  if  and  only  if 

<1  1 


V /I  , 1 

i ■ ,nax  n ‘ > 

' 1-jk  ' a • ni  " 


15 


where  a is  any  fixed  constant  such  that  0 a < •». 

For  any  fixed  P*,  £ < P*  < 1 , and  k = 2,  f n^. 


inf  P(CS|R  ) 

ii  a 


yOT*  T 

f<p( — q.,x)dQ  (x) 

/T~T'  d v 

/ n,  n? 


inf  P ( C S | R i --  / 1 ( c i x ) dQ  ( x ) = P*, 

hence  c,/-^--  = q f e 1 . Since 

1 n-|  in,  n ^ j a 

2 v 

sup  E ( S j P.  ) - sup  v>  P(Y.  ■ max  Y.-q  s . + -) 

d i = l ' ' I < j • 2 > " v ni  d 

> sup  } P(Y.  max  Y.-q  s / -----  + -) 

..  ih  1 ],J  2 J d v n[2)  •' 

2 , n =r 

= sup  ) P(Y.  max  Y-c-iS  v + — ) 
i=l  1 1 j • 2 J 1 0 ni  "? 

--  sup  E (S 1 ) . 

4.  Selection  for  Small  Variances  of  Normal  Populations 

Let  p k denote  k independent  normal  populations  with  unknown 

2 2 2 

variances  o-j,  • respectively,  (<>.  > 0,  i ~ l,2,...,k),  and  with  all 

n O 

means  known  or  unknown.  The  ordered  variances  are  denoted  by  <...■  ik]' 

It  is  assumed  that  there  is  no  a priori  information  available  about  the  correct 

9 

pairing  of  the  given  populations  arid  the  ordered  parameters  The  population 

2 

with  variance  equal  to  is  called  the  best  population.  The  goal  is  to 
select  a non  empty  subset  of  the  k populations  containing  the  best  population. 

Any  such  selection  will  be  called  a correct  selection  (CS). 
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2 2 2 2 
Let  s^  .s^. • • • ,s^  denote  the  sample  variances.  Let  s^  denote  the 

(unknown)  sample  variance  that  is  associated  with  the  ith  smallest 

2 

population  variance,  let  denote  the  number  of  degrees  of  freedom 

associated  with  . Gupta  and  Sobel  [8]  have  proposed  a procedure  for 
this  goal.  Gupta  and  Huang  [6]  obtained  a lower  bound  on  the  infimum  of  the 
correct  selection.  We  modify  Gupt.a-Sobel  procedure  to  obtain  exact  results  to 
satisfy  P*-condi tion  asymptotically  and  apply  the  method  of  Section  2 to 
obtain  an  optimal  procedure. 

R,,:  Retain  -n . in  the  selected  subset  if  and  only  if 

3 i J 

('-  * M 

s-  < min  [(^-)  1 3 s*], 

" lij;  k 3 J 

where  c^,  (0  • c3  ' 1),  is  the  largest  value  satisfying  the  basic  P*-condition. 

We  shall  show  how  large  sample  theory  can  be  used  to  find  very  good 
approximations  to  the  required  probabilities  even  for  relatively  small  n. 

Our  principal  tools  will  be  the  use  of  the  transformation  y - logps  (see  [1]), 
and  the  approach  of  certain  multivariate  distributions  to  multivariate  normal 
distributions. 

0 
r c. 

Let  X.  = logp  i = l,2,...,k.  It  is  known  (see  [8])  that  the 

M , 

1 

expectation  and  variances  are 


Var(Xi-X.)  + :;■>  for  j = 1 ,k ; j f i . 

j ■*  j •’ j 


/ i . , l 

(>>-)•  log  ~2 


is  asymptotically  distributed  as  a standdrd  normal  variable  Z-  as  v. 
Since 


(^)  i 

c3 


,L , L)f 


vi  -J- 

j 2 , 

o • 

J 


,k,  j 4 i 


is  equivalent  to 


1_  _ 1_ 

zii  <~l  i°g  r + 7~~r  ' j = k’  H u 

J & 3 yT  + 2 


' V.  Vj 


where  = ( X • -X  - + ---  - + 2-}'^  , for  1 ■ i,  j • k,  i f j,  and 

U 1 J V.  Vj  V*.  j 


/(I,  vja)(,ri«) 

v[i]  v[i] 


rik  rjk’  ’-J  - 1. •••>-!>  i i J 


Hence  we  can  apply  the  results  in  Section  i to  prove 


- k-l  ckTriku 

inf  p(cs|r,)  « / ;;  ■!(  -kJ  J y)Ji(u), 

-j-l  (l-r;jk)> 


where 


C,  , = loq  - , j-l k-1. 

kJ  3 j T f 2 

’ Ik]  lj] 
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It  should  be  pointed  out  that  for  equal  sample  size  case,  Gupta  and 
Sobel  [8]  compared  the  exact  value  and  the  asymptotic  values  of  the  constant 

c7  to  see  how  close  they  are. 

d v 2 

• S ■ 

For  any  i,  1 i «-  k,  let  ^ -•  | and  T-j  ■ i for  1 j k,  j * i. 

i ' i 

We  can  find  the  joint  density  of  T-j,  j - l,2,...,k,  j f i.  We  can  con. true: t 
an  optimal  procedure  hased  on  Tj-’s  usinq  the  method  of  Section  2. 

5.  Sel  ec_tion_Procedures _ in  Two-Way  Layouts 

Let  7i, , » ''k  be  k populations.  For  a two-factor  complete  block 
design  with  one  observation  per  cell, we  express  the  observable  random 
variables  X-  (i  = l,2,...,k;  . - l,...,n)  as 

i i 


(5.1) 


X . = U + t + l ; + S -j  i > 7 • = 0 , 

it  > i it  i ' ■ 1 1 


where  u is  the  mean-effect,  i-. i , . . . , c ■ are  the  block  effects  (nuisance 
parameters  for  the  fixed  effects  model  or  random  variables  for  the  mixed 
effects  model),  are  the  treatment  effects,  and  the  are  the 

error  components.  Let  denote'  the  n independent  observations 

from  the  ith  population  n ^ . Let  the  joint  density  ol  X^,...,X^n;  X,^,..., 
X0.  ; . . . ; X. X . „ be  of  the  following  form: 


(5.2) 


c . t A : g(  x-'_> ) ' A (x-o)) 


where  x'-  ( x^  j , . . . ,xJn ; . . . ;xfc  j , . . . .x^) , and  u'-  (']  7 • • • • »uin;  • • • ;i)kl  ’ ' * ‘ ’"l 

0.  = ■.  f j l,?,...,k;  < - l,...,n.  and  : is  a known  positive 

la  1 1 

definite  matrix,  c^  is  determined  such  that  (5.L)  is  a dens i tv. 

Our  purpose  is  to  study  some  selection  procedures  to  select  a subset 
of  a random  size  containing  the  "be  t " treatment  . I no  quality  ot  the 


? 

I* 

i 
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treatment  is  judge  by  the  largeness  of  the  r^'s. 


Let  t 


[1] 


[k] 


be  the  actual  ranked  t's  (which  are  unknown),  and 


let 


, n k 

Z.  = X.-X  where  X-  = --  ; X-.,  i = 1, k;  and  X = £ 7 X. . 

1 n j-|  ’J  k i 


We  denote  the  ordered  values  of  the  Z.'s  by  Zj-^  <■....<  Zj-^j  and  let 
be  the  random  variable  associated  with  [j-.j,  i ~ l,...,k. 

By  a similar  argument  as  in  Section  3,  we  know  that  Z.  is  the 
generalized  least  square  estimator  of 

Let  Z = EY,  where  E'  = (E] , . . . >Ek)knxk  with  rank  k,  Y'  = (X-, X1m; 

X.  X.  ),  and  E'  = (-  1 


1 


kl 

1_  _ 1_ 
kn*‘ * ' kn 


kn'  ’ u,,u  Li  ’ kn’-'”"  kri n (1'  ' ’ ’ * (1‘  ^ ’ 

, for  1 < i <•  k. 


11 "In1 

i (i-  1 

n u k' 


Then  the  joint  density  of  Z^,...,7-k  is  of  the  form: 


(5.3) 


V-'  ' 


• hUz-O’y-^z-t)) 


where  z'  = (z,,...,zj,  t’  = ,r.  ) and  ",  - E > V-  {v . ■) . 

K ' 1 K kxk  ,J 

The  methods  to  construct  selection  procedures  are  the  same  as  in 
Section  3. 
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Key  WOFJOS  .'CoiirinU#  on  r«v«r«*  «M«  ti  n# 


mry  m\d  Idmntitv  by  MocA  number; 


locally  best,  selection  procedures,  correct  selec  tion,  general izeci  JL5  [ 

estimates,  Schur-concave  fund  ions,  unequal  sample  si^es. 
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20  aMT R AC  T ( C 'jnfinu#  on  r«v»r»*  «M«  l(  ry  mrut  Identity  by  hli.r*  nt<m6«rj  j 

“In  most  practical  situat ie-s  to  which  the  analyses  of  variance  tests  are  J 
applied,  they  do  not  supply  the  information  that  the  experimenter  aiihs  at  li.l 
for  example,  in  one-way  ANOVrt  the  hypothesis  is  re  jected  in\ntual  apftl  n ation  j 
of  the  F-test,  the  resulting  cone  lus  ion  i hat  the  true  means  rt|  re  net  J 

all  equal,  would  by  itself  usually  he  in  ,uf  t ir . i »-n  L t.u  satisfy  the  exflerimenltT  ; 
In  fact  his  problems  would  begin  at  this  stage,  file  expert  met.  ter  may  desire  j 
to  select  the  ‘'hest*' popular,  inn  or  a subset  of  the  ^guod^popula  lions ; lie  may  \ 
like  to  rank  the  populations  ip  order  of  ■^Tioodneuii  or  he  may  like  to  draw  - — 


IJMIT  Y CL  AiSIFlC  « ! . .N  if  f HIS  F-AGtlWh#*!  O.I«  Btutfd, 


some  other  inferences  about  t tie  parameters  of  interest. 

The  extensive  literature  on  selection  and  ranking  procedures  depends 
heavily  on  the  use  of  independence  between  populations  (block,  treatments, 
etc.)  in  the  analysis  of  variance^  In  practical  applications,  it  is  desirable 
to  drop  this  assumption  of  ««h?pt?ndence  and  consider  cases  more  general  than 
the  norjw+r-  f a,,v-  - j;,<-  'Jf 

<~~ln  the  present  paper,  we- tier  i-ve  a method A:o  construct  locally  oest  (in 
some  sense)  selection  procedures  tu  select  a non  empty  subset  of  the  k popula- 
tions containing  the  best  population  as  ranked  in  terms  of  o.'v  which  control 
the  size  of  the  selected  subset  arid  which  maximizes  the  probability  of  select- 
ing the  best.  Jala  fill  so  consider  the  usual  selection  procedures  in  one-way 
ANOVA  based  on  the  general  iced  least,  squares  estimates  and  apply  the  method  to 
two-way  layout  case ...  Some  examples  are  discussed  and  some  results  on  compari- 
sons witli  other  procWures  ate  also  obtained. 


